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Abstract. Given an n-vertex digraph D = {V,A) the MAX-fc-ORDERiNG problem 
is to compute a labeling i ■. V ^ [k] maximizing the number of forward edges, i.e. 
edges (u, v) such that t(u) < i(v). For different values of k, this reduces to maximum 
acyclic subgraph {k = n), and Max-DiCut {k — 2). This work studies the approxima¬ 
bility of MAX-fc-ORDERiNG and its generalizations, motivated by their applications 
to job scheduling with soft precedence constraints. We give an LP rounding based 
2-approximation algorithm for Max-Ic-Ordering for any k = {2,... ,n}, improving 
on the known 2k/{k — 1) -approximation obtained via random assignment. The tight¬ 
ness of this rounding is shown by proving that for any k = {2 ,..., n} and constant 
e > 0, MAX-fc-ORDERiNG has an LP integrality gap of 2 — e for i/iogiogfe) 
of the Sherali-Adams hierarchy. 

A further generalization of Max-Ic-Ordering is the restricted maximum acyclic sub¬ 
graph problem or RMAS, where each vertex v has a finite set of allowable labels 
Sv C Z+. We prove an LP rounding based A\j2j{\/2-\- l) « 2.344 approximation 
for it, improving on the 2\f2 k. 2.828 approximation recently given by Grandoni et 
al. [ 3 . In fact, our approximation algorithm also works for a general version where 
the objective counts the edges which go forward by at least a positive offset specific 
to each edge. 

The minimization formulation of digraph ordering is DAG edge deletion or DED(fc), 
which requires deleting the minimum number of edges from an n-vertex directed 
acyclic graph (DAG) to remove all paths of length k. We show that both, the LP 
relaxation and a local ratio approach for DED(fc) yield fc-approximation for any k G 
[n]. A vertex deletion version was studied earlier by Paik et al. [17], and Svensson [18] . 


1 Introduction 

One of the most well studied combinatorial problems on directed graphs (digraphs) is the 
Maximum Acyclic Subgraph problem (MAS): given an n-vertex digraph, find a subgrapfdof 
maximum number of edges containing no directed cycles. An equivalent formulation of MAS 
is to obtain a linear ordering of the vertices which maximizes the number of directed edges 
going forward. A natural generalization is MAX-fc-ORDERiNG where the goal is to compute 
the best k-ordering^ i.e. a labeling of the vertices from [fc] = {1,..., fc} {2 < k < n)^ which 
maximizes the number of directed edges going forward in this ordering. It can be seen - 
and we show this formally - that MAX-fc-ORDERiNG is equivalent to finding the maximum 
subgraph which has no directed cycles, and no directed patht0 of length k. Note that MAS 
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is the special case of MAX-fe-ORDERiNG when k = n, and for fc = 2 MAX-fe-ORDERiNG 
reduces to the well known Max-DiCut problem. 

A related problem is the Restricted Maximum Acyclic Subgraph problem or RMAS, in 
which each vertex v of the digraph has to be assigned a label from a finite set Sy C Z+ 
to maximize the number of edges going forward in this assignment. Khandekar et al. |10| 
introduced RMAS in the context of graph pricing problems and its approximability has 
recently been studied by Grandoni et al. [7]. A further generalization is OffsetRMAS 
where each edge (m, v) has an offset Oe G and is satisfied by a labeling ^ if l{u)+Oe < (-{v). 
Note that when all offsets are unit OffsetRMAS reduces to RMAS, which in turn reduces 
to MAX-fc-ORDERiNG when Sy = [k] for all vertices v. 

This study focuses on the approximability of MAX-fc-ORDERiNG and its generalizations 
and is motivated by their applicability in scheduling jobs with soft precedences under a 
hard deadline. Consider the following simple case of discrete time scheduling: given n unit 
length jobs with precedence constraints and an infinite capacity machine, find a schedule 
so that all the jobs are completed by timestep k. Since it may not be feasible to satisfy all 
the precedence constraints, the goal is to satisfy the maximum number. This is equivalent 
to Max-Zc-Ordering on the corresponding precedence digraph. One can generalize this 
setting to each job having a set of allowable timesteps when it can be scheduled. This can 
be abstracted as RMAS and a further generalization to each precedence having an associated 
lag between the start-times yields OffsetRMAS as the underlying optimization problem. 

Also of interest is the minimization version of Max-Zc-Ordering on directed acyclic 
graphs (DAGs). We refer to it as DAG edge deletion or DED(A:) where the goal is to delete 
the minimum number of directed edges from a DAG so that the remaining digraph does 
not contain any path of length k. Note that the problem for arbitrary k does not admit any 
approximation factor on general digraphs since even detecting whether a digraph has a path 
of length k is the well studied NP-hard longest path problem. A vertex deletion formulation 
of DED(fc) was introduced as an abstraction of certain VLSI design and communication 
problems by Paik et al. m who gave efficient algorithms for it on special cases of DAGs, and 
proved it to be NP-complete in general. More recently, its connection to project scheduling 
was noted by Svensson m who proved inapproximability results for the vertex deletion 
version. 

The rest of this section gives a background of previous related work, describes our results, 
and provides an overview of the techniques used. 


1.1 Related Work 

It is easy to see that MAS admits a trivial 2-approximation, by taking any linear ordering or 
its reverse, and this is also obtained by a random ordering. For Max-Zc-Ordering the ran¬ 
dom fc-ordering yields a 2k/{k — 1) -approximation for any k G {2,..., n}. For fc = 2, which 
is Max-DiCut, the semidefinite programming (SDP) relaxation is shown to yield a ~ 1.144- 
approximation in improving upon previous analyses of and [5]. As mentioned 

above, RMAS is a generalization of MAX-fc-ORDERiNG, and a 2-\/2-approximation for it 
based on linear programming (LP) rounding was shown recently by Grandoni et al. [7] 
which is also the best approximation for Max-Zc-Ordering for Zc = 3. For 4 < Zc < n — 1, to 
the best of our knowledge the proven approximation factor for M AX- Zc- Ordering remains 
2fc/(Zc- 1). 

On the hardness side, Newman [16] showed that MAS is NP-hard to approximate within 
a factor of 66/65. Assuming Khot’s [11] Unique Games Conjecture (UGC), Guruswami et 
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al. [5] gave a (2 — e)-inapproximability for any £ > 0. Note that Max-DiCut is at least as 
hard as Max-Cut. Thus, for fc = 2, Max-Zc-Ordering is NP-hard to approximate within 
factor (13/12 — e) [S], and within factor 1.1382 assuming the UGC [T2]. For larger constants 
/c, the result of Guruswami et al. [5] implicitly shows a (2 — Ofc(l))-inapproximability for 
MAX-fc-ORDERiNG, assuming the UGC. 

For the vertex deletion version of DED(A:), Paik et al. [T7] gave linear time and quadratic 
time algorithms for rooted trees and series-parallel graphs respectively. The problem re¬ 
duces to vertex cover on fc-uniform hypergraphs for any constant k thereby admitting a k- 
approximation, and a matching (fc — £)-inapproximability assuming the UGC was obtained 
by Svensson [TB] . 

1.2 Our Results 

The main algorithmic result of this paper is the following improved approximation guarantee 
for MAX-fc-ORDERING. 

Theorem 1. There exists a polynomial time 2-approximation algorithm for MAX-k-ORDERiNG 
on n-vertex weighted digraphs for any k £ { 2 ,..., n}. 

The above approximation is obtained by appropriately rounding the standard LP relaxation 
of the CSP formulation of MAX-fc-ORDERiNG. For small values of fc this yields significant 
improvement on the previously known approximation factors: 2\f2 for fc = 3 (implicit in 
[7]), 8/3 for fc = 4, and 2.5 for fc = 5. The latter two factors follow from the previous best 
2fc/(fc — 1)-approximation given by a random fc-ordering for 4 < fc < n — 1. The detailed 
proof of Theorem [1] is given in Section [3l 

Using an LP rounding approach similar to Theorem[T] in Section|4]we show the following 
improved approximation for OffsetRMAS which implies the same for RMAS. Our result 
improves the previous 2\f2 k, 2.828-approximation for RMAS obtained by Grandoni et 

al. H. 

Theorem 2. There exists a polynomial time 4-\/2/('\/2+ 1) ~ 2.344 approximation algo¬ 
rithm for OffsetRMAS on weighted digraphs. 

Our next result gives a lower bound that matches the approximation obtained in The¬ 
orem [TJ In Section O we show that even after strengthening the LP relaxation of MAX-fc- 
Ordering with a large number of rounds of the Sherali-Adams hierarchy, its integrality 
gap remains close to 2 , and hence Theorem [T] is tight. 

Theorem 3. For any small enough constant £ > 0, there exists 7 > 0 such that for Max- 
fc-ORDERiNG on n-vertex weighted digraphs and any k £ {2,... ,n}, the LP relaxation with 
^( 7 /iogiogfe) rounds of Sherali-Adams constraints has a {2 — e) integrality gap. 

For DED(fc) on DAGs we prove in Section [ 6 ] the following approximation for any fc, not 
necessarily a constant. 

Theorem 4. The standard LP relaxation for DED(fc) on n-vertex DAGs can be solved in 
polynomial time for k = {2,..., n — 1} and yields a k-approximation. The same approxima¬ 
tion factor is also obtained by a combinatorial algorithm. 

We complement the above by showing in Section [6.31 a ([fc/2j — £) hardness factor for 
DED(fc) via a simple gadget reduction from Svensson’s [18] (fc — £)-inapproximability for 
the vertex deletion version for constant fc, assuming the UGC. 
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1.3 Overview of Techniques 

The approximation algorithms we obtain for MAX-fc-ORDERiNG and its generalizations are 
based on rounding the standard LP relaxation for the instance. MAX-fc-ORDERiNG is viewed 
as a constraint satisfaction problem (CSP) over alphabet [fc], and the corresponding LP 
relaxation has [0, l]-valued variables x" for each vertex v and label i £ [A:], and yfj for each 
edge (u, v) and pairs of labels i and j to u and v respectively. We show that a generalization 
of the rounding algorithm used by Trevisan m for approximating q-aiy boolean CSPs 
yields a 2-approximation in our setting. The key ingredient in the analysis is a lower bound 
on a certain product of the {x“}, {x"} variables corresponding to the end points of an edge 
e = (it, v) in terms of the {Uij} variables for that edge. This improves a weaker bound shown 
by Grandoni et al. [7]. For Offset RMAS, a modification of this rounding algorithm yields 
the improved approximation. 

The construction of the integrality gap for the LP augmented with Sherali-Adams con¬ 
straints for Max-A-Ordering begins with a simple integrality gap instance for the basic 
LP relaxation. This instance is appropriately sparsified to ensure that subgraphs of polyno- 
mially large (but bounded) size are tree-like. On trees, it is easy to construct a distribution 
over labelings from [k] to the vertices (thought of as fc-orderings), such that the marginal 
distribution on each vertex is uniform over [/c] and a large fraction of edges are satisfied in 
expectation. Using this along with the sparsification allows us to construct distributions for 
each bounded subgraph, i.e. good local distributions. Finally a geometric embedding of the 
marginals of these distributions followed by Gaussian rounding yields modified local distri¬ 
butions which are consistent on the common vertex sets. These distributions correspond to 
an LP solution with a high objective value, for large number of rounds of Sherali-Adams 
constraints. Our construction follows the approach in a recent work of Lee m which is 
based on earlier works of Arora et al. [1] and Charikar et al. [3]. 

For the DED(fc) problem, the approximation algorithms stated in Theorem |4] are ob¬ 
tained using the acyclicity of the input DAG. In particular, we show that both, the LP 
rounding and the local ratio approach, can be implemented in polynomial time on DAGs 
yielding fc-approximate solutions. 


2 Preliminaries 

This section formally defines the problems studied in this paper. We begin with MAX-fc- 
Ordering. 

Definition 1. MAX-fc-ORDERiNG; Given an n-vertex digraph D = (U,A) with a non¬ 
negative weight function w : A ^ R’*', and an integer 2 < k < n, find a labeling to the 
vertices A : U —>■ [fc] that maximizes the weighted fraction of edges e = (u,v) £ A such that 
i{u) < i{v), i.e. forward edges. 

It can be seen that Max-/c-Ordering is equivalent to the problem of computing the max¬ 
imum weighted subgraph of D which is acyclic and does not contain any directed path of 
length k. The following lemma implies this equivalence. 

Lemma 1. Given a digraph D = (U, A), there exists a labeling : V ^ [k\ with each edge 
e = (m,u) € A satisfying £(u) < £{v), if and only if D is acyclic and does not contain any 
directed path of length k. 
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Proof. If such a labeling i exists then every edge is directed from a lower labeled vertex to 
a higher labeled one. Thus, there are no directed cycles in D. Furthermore, any directed 
path in D has at most k vertices on it, and is of length at most fc — 1. On the other hand, 
if D satisfies the second condition in the lemma, then choose £{v) for any vertex v to be 
ty + 1, where ty is the length of the longest path from any source to v. It is easy to see that 
£{v) £ [fc] and for each edge {u,v), i{u) < £{v). □ 

The generalizations of MAX-fc-ORDERiNG studied in this work, viz. RMAS and Off- 
SEtRMAS, are defined as follows. 

Definition 2. OffsetRMAS.' The input is a digraph D = {V,A) with a finite subset 
Sy C Z'*' of labels for each vertex v G V, a non-negative weight function w : A ^ R"*', and 
offsets Oe G for each edge e G A. A labeling £ to V s.t. £{v) £ Sy,\/v £ V satisfies an 

edge e = {u^v) if £(u) + Og < £['v). The goal is to compute a labeling that maximizes the 
weighted fraction of satisfied edges. RMAS is the special case when each offset is unit. 

As mentioned earlier, DED(fc) is not approximable on general digraphs. Therefore, we 
define it only on DAGs. 

Definition 3. DED(fc); Given a DAG D = (V,A) with a non-negative weight function 
w '. A ^ R+, and an integer 2 < k < n — 1, find a minimum weight set of edges F Q A such 
that (y, A\ F) does not contain any path of length fc. 

The rest of this section describes the LP relaxations for MAX-fc-ORDERiNG and OffsetR¬ 
MAS studied in this paper. 


2.1 LP Relaxation for MAX-fc-ORDERiNG 

From Definition [U an instance I of MAX-fc-ORDERiNG is given hy D = (V, A), fc, and w. 
Viewing it as a CSP over label set [fc], the standard LP relaxation given in FigureHjis defined 
over variables x^ for each vertex v and label i, and yfj for each edge e = (u, v) and labels i 
to u and j to v. 

Sherali-Adams Constraints. Let £ [0,1] be a variable corresponding to a subset S of 
vertices, and a labeling cr : S' —>■ [fc]. The LP relaxation in Figured] can augmented with r 
rounds of Sherali-Adams constraints which are defined over the variables {z^ \ I < |S| < 
r +1}. The additional constraints are given in Figure [2] The Sherali-Adams variables define, 
for each subset S of at most (r -|- 1) vertices, a distribution over the possible labelings from 
[fc] to the vertices in S. The constraints given by Equation (I7|) ensure that these distributions 
are consistent across subsets. Additionally, Equations ([9|) and OT ensure the consistency 
of these distributions with the variables of the standard LP relaxation given in Figured! 

LP Relaxation for RMAS and OffsetRMAS. The LP relaxation for RMAS is a 
generalization of the one in Figuredjfor MAX-fc-ORDERiNG and we omit a detailed definition. 
Let S = denote the set of all labels. For convenience, we define variables {xf \ v G 

V,i G S} and {yfj \ e = {u,v) G A, i,j G 5} and force the infeasible assignments to be zero, 
i.e. x^ = 0 £oi i ^ Sy. The other constraints are modified accordingly. For OffsetRMAS, 
an additional change is that the contribution to the objective from each edge e = (m, v) is 

SieSu,iGS„.i+Oe<j Vij- 
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max ^ w{e) ■ ^ yfj 

eg A ije[fc] 

i<j 


(1) 

Va G F, 

^ = 1. 

(2) 

Ve = (u,v) € A, and i,j € [fc], 

\ ^ . e __u 

/ j Vii ! 

i€[k] 

(3) 

and. 

\ ^ e V 

2^ vt-i = ■ 

l&{k\ 

(4) 

Vv G V, and i G [fc]. 

Xi > 0. 

(5) 

Ve G A, and i,j G [fc]. 

Vij > 0. 

(6) 


subject to, 


Fig. 1. LP Relaxation for instance X of MAX-fc-ORDERiNG. 


3 A 2-Approximation for MAX-fc-ORDERiNG 


This section proves the following theorem that implies Theorem [TJ 

Theorem 5. Let {x^}, {yfj} denote an optimal solution to the LP in Figure[^ Let i : V ^ 
[k] be a randomized labeling obtained by independently assigning to each vertex v label i with 
probability l/2k + a;^/2. Then, for any edge e = {u,v), 


Pr[£(M) < l{v)] > i 


( \ 


\ i<j 




To analyze the rounding given above, we need the following key lemma that bounds the sum 
of products of row and column sums of a matrix in terms of the matrix entries. It improves 
a weaker bound shown by Grandoni et al. [7] and also generalizes to arbitrary offsets. 


Lemma 2. Let A = [oy] be a k x k matrix with non-negative entries. Let ri = 'Yfj atj and 
Cj = Yi denote the sum of entries in the row and column respectively, and let 
1 < 0 < k — 1 be an integer offset . Then, 


Y ^ 

i+9<j 


fc — 0 + 1 
2lk-9) 


i V 

Yh 


i+e<j 


/ 


( 11 ) 
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yscTcv, 

1 < |5|.|T| <r + l, 

and (T : S —>■ [fe]. 

.-S T 

Zrj — 2_^ ■ 

p-.T^[k] 

pls='^ 

(7) 

VS' C F, 1 < |5| < r + 1, 

and (T : S —>■ [fc], 

0 < < 1 . 

(8) 

Vw G V, and a : {w} —>■ [A:], 

s.t. a{v) = i, 

II 

(9) 

Ve = {u,v) G A, and, 
o- : {m,w} -s- [A:], 

s.t. (a{u),a{v)) = 

Vij = 

(10) 

Fig. 2. r-round Sherali-Adams constraints for LP relaxation in 

Figure [T] 


Proof. The LHS of the above is simplihed as, 


E = E 


1_ 

1- 

_ J 

' 1- 


> 'y ' a^y + 2 • 'y ^ dxyClxy' + y ^ 0,xy0.x'y'^ 

x+9<y 


x+e<y 

x+e<y' 

v<v' 


x+9<y 

x'+9<y' 

x<.x' 


( 12 ) 

(13) 


where all the indices above are in [A]. Note that (fT51) follows from (fT^ because: 

(i) For any x + 6 < y, a^y appears in the RHS of (fT^ when i = x and j = y. 

(ii) For x + 6 < y and x + 9 < y', Oxyttxy’ appears in the RHS of (fT^ both, when i = x,j = y, 

and when i = x, j = y'. 

(iii) For any x + 6 < y and x' + 9 < y' (say x < x'), it must be that x + 9 < y', and hence 

CLxyax’y’ appears in the RHS of (fT^ when i = x and j = y'. 

Thus, we obtain, 


/ 


\ 


E ^ 

i+9<j 


x+9<y 


E 


^xy^x'y' 


x-\-9<y 
.x'-\-d<y' 
' x<.x' 


(14) 
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Therefore, it is sufficient to show that 


a^yax'y' < I ^ 

x+e<y 
x'+e<y' 
x<.x' 


2(k-e) I ^ 

^ ' ' x+9<y 


(15) 


Substituting, 


E 

,x+9<y 


^xy 


— a^y + 2 • axyCLxy' + 2 • 


O'xy^x'y' 5 


a:+e<y 


a:+e<y 

a;+e<y' 

y<y' 


x+9<y 

x'+9<y' 

x<ix' 


and simplifying, inequality dTSl) can be rewritten as, 

2 


E “^y + 2 E 

x+9<y 


Cl'xy^xy' 


x+9<y 

x+9<y' 

y<y' 


k-e-1 


a^Ma > 0, 


^ ^ 0>xy^x'y' ^ 0; 


x+9<y 

x'+6<y' 

x<.x' 


(16) 


(17) 


where a G M"^, Z := {(x,y) | x + 0 < y and x,y G [fc]} with a(^x,y) ■= a-nd M G 
is a symmetric matrix defined as follows: 


^{x,y){x',y') — * 


1 a {x,y) = {x',y'), 

1 if a:' = X, and y ^ y', 

— l/{k — 9 — l) iix^x'. 


(18) 


To complete the proof of the lemma we show that M is positive semidefinite. Consider the 
set of unit vectors {vx \ I < x < k — 9} given by the normalized corner points of the 
(k — 9 — l)-dimensional simplex centered at the origin. It is easy to see (for e.g. in Lemma 
3 of [5]) that, {vxtVx') = —l/(fc — 9 — \) \i x ^ x'. Thus, M = L, where L is a matrix 
whose columns are indexed by Z such that the (x, y) column is Vx- Therefore, M is positive 
semidefinite. □ 


Proof (of Theorem\^ . For brevity, let Ze = J2i<j Vij denote the contribution of the edge e 
to the LP objective. From the definition of the rounding procedure we have, 

Pr[^(M) < ^{v)] = ^Pr[£(u) = i] Pr[^(u) = j] 


i<3 


= E 

i<j 


1 xf 

-h — 

2fe 2 


-h — 

2fe 2 




2 k 




i<j 


i<j 






We can now apply Lemma [2] to the kx k matrix \yfj\- The LP constraints guarantee that 
Ti = a;“ and Cj = Xj are equal to the row and column sums respectively. Further, substituting 
offset 9 = 1, we obtain 


Pr[i{u) < £{v)] > \ ^ 


+ T^iXi +X]) + 


i<j 


2ik-l) 


On the other hand, 

+ x]) = - l)a 


k-1 


i<j 


2=1 

A:-l 


i=2 


(^ - *) E 

j'>i 


k 

E 

j=2 


(j -1) E yh 


i'<3 


(19) 


( 20 ) 


For a < b, t/®f, appears (fc — a) times in the RHS of the above inequality when i = a, and 
{h — 1) times when j = b. Since fc — a + 6—l>/c, we obtain that RHS of Equation (EUl) is 
lower bounded by kJ2a<byab ~ Substituting back into Equation dT91) and simplifying 
gives us that Pr[£(u) < £{v)] is at least, 


[i + E ~ + _kz^\ 

4 2 \ kze (k — 1)) 


>f(l + l) 


2 ’ 


( 21 ) 


where we use t + 1/t > 2 for t > 0. 


□ 


4 Approximation for OffsetRMAS 


Let D = (F, 4), {S'„}„gv, w, and {oe}eeA constitute an instance of OffsetRMAS as given 
in Definition [2j Without loss of generality, one can assume that for each edge e = {u,v) e A, 
min(S'u) + Oe < max(S'„), otherwise no feasible solution can satisfy e and that edge can 
be removed. A simple randomized strategy that independently assigns each vertex v either 
^min ■— inin(S'.u) or £^i^^ := max(S'.u) with equal probability is a 4-approximation. The 
recent work of Grandoni et al. [7] show that combining this randomized scheme with an 
appropriate LP-rounding yields a 2\/2 fa 2.828 approximation algorithm for RMAS. 

We show that a variant of the rounding scheme developed in Section |3] yields an improved 
approximation factor for OffsetRMAS. In particular, we prove the following theorem 
which implies Theorem!^ 

Theorem 6. Let {x^}, {yfj} denote an optimal solution to the linear programming relax¬ 
ation o/O ffsetRMAS described in Section\^ Let £ he a randomized labeling obtained by 
independently assigning labels to each vertex v with the following probabilities: 


Pr[^(r;) = i] = 


4 _' 2 

xl 

Then, for any edge e = {u, v) we have 


1 if i G ££'" . f" \ 




Pv[£{u) +Oe< £iv)] > - 1 -b 


V2 


E yt 

^ i-\-Oe<j 


( 22 ) 


9 














Proof. Let S = denote the set of all labels and let Zg = Vij) denote the 

contribution of the edge e to the LP objective. We have, 

Pr[^(M) + Oe < (.{v)] = ^ Pr[^(M) = i] Pr[£(u) = j] 

i+Oe<i 

jeS'u,ieS„ 

Substituting the assignment probabilities from (I22p into the above and simplifying we obtain, 


Pr[^(M) + Oe < £(u)] 


/ 



1 

+ 4 

/ 

\ 

E 

<+ E 


E 

XiXj 

\ i&S 

-Oe 

ies 



I+Oe<j 

\ iJ&S 

/ 


(23) 


Note that we allow i,j G S in the above sums instead of and S'„. This does not affect the 
analysis as the LP forces = 0 for i ^ Su and similarly for v. Now, consider the |iS| x |iS| 
matrix [yfj]- Since a:“ and Xj are equal to the row sums and column sums of this matrix 
respectively. Lemma [2] guarantees that. 


i+Oe<j 

i,jes 


l^l — Oe + 1 

2(|5|-Oe) 


ieS^u. ,3^Sv 
i+o^<j 


> 


(|5| - Oe + 1) 2 > 4 

2(|5| - Oe) " - 2 ■ 


We thus have. 


Pr[£(u) + Oe < £(u)] 


1 

> — 
- 16 



E 

i>Cin+Oe / 


1 

> - 

- 16 


1 

16 


1 

> — 
- 16 

_ £e 

4 




+ E 



( 

2 E 


i+Oe<j 

\ ieSu.ieSi 



\ 

/ 


2 


2 

e 






(24) 


where the last inequality uses t + 1/at > for a,t > 0. 


□ 


5 Sherali-Adams Integrality Gap for MAX-fc-ORDERiNG 

For convenience, in the construction of the integrality gaps presented in this section the 
integral optimum and the LP objective shall count the weighted fraction of edges satisfied. 
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We begin with a simple construction of an n-vertex digraph which is a (2 — 2/n) integrality 
gap for the standard LP relaxation for MAX-fc-ORDERiNG in Figure [I] for 2 < fc < n. 

Claim 1. Let D = {V,A) he the complete digraph on n vertices, i.e. having a directed edge 
for every ordered pair {u, v) of distinct vertices u and v. Thus, |^| = 2 ( 2 ). Let k € {2, ... ,n}. 
Then, 

— The optimum 0 / Max-Zc-Ordering on D is at most ^ ~ j) • 

— There is a solution to the standard LP relaxation for Max-Zc-Ordering on D with 

value (1 — . 

In particular, the above implies a (2 — 2/n) integrality gap for the LP relaxation in Figure 

m 


Proof. The number of forward edges is simply the number of ordered pairs of vertices (u, v) 
with distinct labels. By Turan’s Theorem, the optimal integral solution is to partition the 
vertices into k subsets whose sizes differ by at most 1 , giving each subset a distinct label 
from {1,..., Zc}. This implies that there are at most ^ — p) forward edges. Hence, the 

optimal integral solution has value. 



On the other hand, consider an LP solution that assigns x'f = ^ for all m £ F and i £ [Zc], 
and yfi+i = ^ for all e = {u,v) G A and i G [k — 1]. Each edge e contributes. 


k-l 

'y Vi.i+l 

i=l 



to the objective. 


□ 


The above integrality gap is essentially retained even after near polynomial rounds of the 
Sherali-Adams constraints given in Figure [21 In particular, we prove the following that 
implies Theorem |3l 

Theorem 7. For any constant e > 0, there is 7 > 0 such that for large enough n G TA and 
any k G {2,... ,n}, there is a weighted digraph D* = (V*,A*) satisfying, 

— The optimum 0 / Max-Zc-Ordering on D* is at most f (l ~ ■!) + £■ 

— The LP relaxation for yLAX-k-ORDERWC augmented with ^rounds of Sherali- 

Adams constraints has objective value at least (l — e)(l — . 

The rest of this section is devoted to proving the above theorem. Our construction of the 
integrality gap uses the techniques of Lee m who proved a similar gap for a variant of the 
graph pricing problem. We begin by showing that a sparse, random subgraph D', of the 
complete digraph D mentioned above, also has a low optimum solution. For this, we require 
the following result on e-samples [20] for finite set systems that follows from Hoeffding’s 
bound. The reader is referred to Theorem 3.2 in [2] for a proof. 
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Theorem 8. Let (U,S) denote a finite set syster^. Suppose A is a multi-set obtained by 
sampling from Li independently and uniformly m times where m > ^ In . Then with 
probability at least 1 — <5, 


lins'l 

|i| 



V5'g5. 


A is referred to as an e-sample for ilA^S). 


In order to construct a solution that satisfies Sherali-Adams constraints for a large number 
of rounds, we require the instance to be locally sparse, i.e. the underlying undirected graph 
is almost a tree on subgraphs induced by large (but bounded) vertex sets. We use the notion 
of path decomposability as defined by Charikar et al. [5] as a measure of local sparsity. 

Definition 4. [Path Decomposability] A graph G is l-path decomposable if every 2-connected 
subgraph H of G contains a path of length I such that every vertex of the path has degree 2 
in H. 


We proceed to show that the sparse graph D' obtained as above can be further processed so 
that it is locally sparse. Applying the techniques in [3] and m yields a solution with high 
value that satisfies the Sherali-Adams constraints. 


5.1 Constructing a Sparse Instance 

Lemma 3. Let D = {V,A) be the complete digraph on n vertices, let k G {2,...,n} and 
e > 0 be a small constant. The weighted digraph D' = {V, A') obtained by sampling 
edges uniformly at random satisfies Opt(D') < § (l ~ £ with high probability, where 

Opt denotes the optimum o/MAX-fc-ORDERiNG. 


Proof. Let V = [n] and A C [n] x [n] denote the vertices and edges of the digraph D. Let 
p : [n] —>■ [k] denote some labeling of the vertices. Let Sp := {(*, j) | p{i) < p{j), {i,j) G A} 
denote the subset of edges that are satisfied by p, and 5 := {5'p | V labelings p} denote 
the collection of such subsets induced by all feasible labelings. Since the number of distinct 
labelings is /c", we have that jiSj < /c". 

We now construct an (£:/2)-sample for the set system (A, 5) by randomly sampling edges 
(with replacement) as per Theorem jS] Let A denote the bag of randomly chosen In 
edges. Substituting (5 = i, we get that |A| = f7( ”^3f ^ ) and with probability at least 1 — 
we have, 


l^pHAj 

1^1 


1^1 


< e/2, Vp. 


(25) 


In order to avoid multi-edges in the construction, we define the weight of an edge w(u, v) 
to be the number of times that edge is sampled in A and let A' denote the set of thus 
weighted edges obtained from A. Equation ([25]) along with Claim |T] guarantees that the 
optimum integral solution of the weighted graph D' induced by the edges A' is bounded by 
Opt{D') < Opt{D) + £ (itt) + “ i) + ^ desired. □ 

®A set system {U,S) consists of a ground set Li and a collection of its subsets S C 2^. It is 
called finite if \U\ is finite. 
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Given a digraph, let its corresponding undirected multigraph be obtained by replacing every 
directed edge by the corresponding undirected one. Note that if the digraph contains both 
(m, v) and (v, u) edge for some pair of vertices, then the undirected multigraph contains two 
parallel edges between u and v. 

We now show that D' obtained in Lemma [3] can be slightly modified so that its corre¬ 
sponding underlying multigraph is almost regular, has high girth, and is locally sparse i.e. all 
small enough subgraphs are Lpath decomposable for an appropriate choice of parameters. 

Lemma 4. Let k = {2,... ,n} and e > 0 be a small enough constant. Given the complete 
digraph D = (V., A) on n vertices, let D' = (V, A') he obtained by sampling (with replacement) 
g)( raiogfc ) uniformly at random. Then, with high probability there exists a subgraph 

D" = (y,A") of D' obtained by removing at most e\A'\ edges, such that the undirected 
multigraph G" underlying D" satisfies the following properties: 

1. Bounded Degree: The maximum degree of any vertex is at most 2A and G" has G(An) 

edges, where A = 

2. High Girth: G" has girth at least I = 0( 1°^^ ). 

Proof. Since D' is obtained by sampling edges uniformly, the probability that any 

given edge is selected is p = In addition, these events are negatively correlated. 

Therefore given any set of edges S, the probability that all the edges in S are sampled is 
upper bounded by 

Bounded Degree: As the maximum degree of any vertex in D is at most 2n, the expected 
degree of any vertex n G H in D' is at most A = 2pn = Call a vertex v a V bad if 

it has degree more than 2A in D', and call and edge {u, v) G A' bad if either u or v is bad. 
Now, for any edge (u,v), the probability that {u,v) is bad given that {u,v) G A! is at most 
2e“^ by Chernoff bound. Hence, the expected number of bad edges is at most 2e“'3'|H'|. 
Finally, by Markov’s inequality, with probability at least the number of bad edges is at 
most 4e“^ |A'|. Deleting all bad edges guarantees that the maximum degree of D' is at most 
2A and with probability at least half, we only delete 4e~~ \A'\ edges which is much smaller 
than e\A'\ since A = 

Girth Gontrol: Let G' denote the undirected multigraph underlying D'. Since the degree 
of any vertex in D is at most 2n, we have 

E[Number of cycles in G' of length i\ < n(2n)*“^p* < (CAf 
for some constant C. For i = 0( we get 

E[Number of cycles in G' of length i\ < 

Summing up over alH in 2 ... / = 0( ), we get that the expected number of cycles of 

length up to I is at most and hence it is less than 0{n^ '^) with high probability. We 

can then remove one edge from each such cycle (i.e. oin) edges) to ensure that the graph 
G" so obtained has girth at least 1. In particular, note that the corresponding digraph D" 
has no 2-cycles. □ 
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Ensuring Local Sparsity Using Lemma S] we ensure that the subgraph of G" induced 
by any subset of vertices is Z-path decomposable for some constant d > 0. The following 
lemma shows that 2-connected subgraphs of G" of polynomially bounded size are sparse. 

Lemma 5. The undirected multigraph G" underlying the digraph D" satisfies the following 
p, i.e., there exists 6 > 0 such that every 2-connected subgraph G of G" containing t' < 
vertices has only (1 + rj)t' edges where rj = ^. 

Proof. Let G denote the undirected multigraph underlying the graph D that was used as a 
starting point for Lemma ID The proof proceeds by counting the number of possible “dense” 
subgraphs of G and showing that the probability that any of them exist in G" after the 
previous sparsification steps is bounded by o(l). We consider two cases based on the value 
of t'. 

Case 1: 4: < t' < K We first bound the total number of 2-connected subgraphs of G 
with t' vertices and t' -\-l edges. It is easy to verify that the only possible degree sequences 
for such subgraphs are (4,2,2,...) or (3,3, 2,2,...). Suppose it is (4, 2,2,...) and let v be 
the vertex with degree 4. Now, there must be a sequence of t' -|- 2 vertices {v,... ,v,... ,v) 
that represents an Eulerian tour. But the number of such sequences is upper bounded 
by nt'{ny = t'n* {n for guessing v, t' for guessing the position of v in the middle, 
and n* to guess the other t' — 1 vertices). Now assume that the degree sequence is 
(3, 3, 2, 2,...) and u, v be the vertices with degree 3. Now, there must a sequence of t -I- 2 
vertices {u,... ,v,... ,u,... ,v) that represents an Eulerian path from u to v. By a similar 
argument, the number of such sequences is bounded by t' n* . Hence, we are guaranteed 
that the total number of 2-connected subgraphs of G with t' vertices and t' 1 edges is at 
most 2 t'^n‘ . 

Therefore, the probability that there exists a subgraph of G" with t' vertices and t' -\-l 
edges for 4 < T < i = 3Z is at most 


31 


t'=4 


2t'‘^n^'p^'+^ = 


31 






t’+i 


<^/3( = 

n 


t) 


3/-I-1 


(26) 


where G is an appropriate constant. For I = have that i^)l ^_ 

o(l). 

Case 2: n'* > > ^ = 3Z. In this case, we count the number of subgraphs of G with t' 

vertices and {l-\-ri)t' edges. As shown by Lee [13], the number of such subgraphs is bounded 
by Ga* n* (^)^''* for some constants C and a. 

Therefore, the probability that such a subgraph exists in G" is at most 


< (CiA^)*' { g 2 


j./\ 2t7t' 


C+v)t' 




(27) 


where Gi and C 2 are appropriate constants. We choose I = 0(j^^) and S G (0,0.1), such 
that above quantity is less than Summing up over all t' = 31,... ,n^, we still have 

that probability such a subgraph exists is bounded by o(l). □ 
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Finally, we need the following lemma proved by Arora et al. [T] regarding the existence 
of long paths in sparse, 2 -connected graphs. 

Lemma 6 (Arora et al. [1]). Let I >1 be an integer and 0 < ry < and let H be a 

2-connected graph with t vertices and at most (1 + r])t edges and H is not a cycle. Then H 
contains a path of length at least I + 1 whose internal vertices have degree 2 in H. 

Corollary 1. Every subgraph G of G" that is induced on at most t' < vertices is {I — 1)- 
path decomposable. 

Proof. Consider any 2-connected subgraph H of G. If H is not a cycle, then Lemma [5] and 
Lemma [5] together guarantee that H contains a path of length at least I 1 such that all 
internal vertices have degree 2 in iL, which gives us a path of length (/ — 1 ) with all vertices 
of degree 2 in H. On the other hand, if 7L is a cycle, then Lemma H] guarantees that H has 
at least I -I -1 vertices and hence again the required path exists. □ 

For convenience we replace (^ — 1) in Corollary [T] with I, and since I = G( l°^2 )> this does 
not change any parameter noticeably. 


Final Instance 

Theorem 9. Given k G {2,..., n} and constants e, p > 0, there exists a constant 7 > 0, 
and parameters A = and I = 0{ I°g2 ) suc/i that there is an instance D (with 

underlying undirected graph G) 0 /Max-/c-Ordering with the following properties 

— Low Integral Optimum: Opt{D) < ^(1 — e. 

— Almost Regularity: Maximum Degree of G < 2A, and G has f2(An) edges. 

— Local Sparsity: Fort < every induced subgraph of G on (2Z\)*t vertices is l-path 

decomposable. 

— Large Noise: For t < (1 — 

Note that = ^^ 2 (i/iogiogfc)^ 

Proof. Let D" be the digraph obtained from Lemma ID Lemmas [3] and |4] imply that the 
digraph D" so obtained (i) has low integral optimum, (ii) is almost regular, and (hi) has 
girth > 1. 

The large noise condition is satisfied by ^ > ( G"f log n/log A ) for an appropriate constant 
G. 

Corollary [T] guarantees that the local sparsity condition is satisfied if {2A)h < , i.e. 

I < G'{5 — 7 )logn for another constant G'. Hence, by selecting a small enough constant 
7 and an appropriate I = 6 >( )i the instance D" obtained in Lemma |4] satisfies all the 
required properties. □ 


5.2 Constructing Local Distributions 

Let D = iy, A) be the instance of MAX-fc-ORDERiNG constructed in Theorem IH] and let 
G = (y, E) be the underlying undirected graph. We now show that there exists a solution 
to the LP after t = rounds of the Sherali-Adams hierarchy whose objective is at 

least (1 — e)(l — ^). Our proof for the existence of such a solution essentially follows the 
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approach of Lee m- Given a set of t < vertices S = {vi,V 2 ,... ,Vt}, our goal is to 

give a distribution on events {i^vi) = Xi,£{v 2 ) = X 2 , ■ ■ ■ = Xt}xi,x 2 ,...,xte[k]- 

Let d(u, v) be the shortest distance between u and v in the (undirected) graph G. Let 
V G V he the set of vertices that are at most I distance away from S and let G' be the 
subgraph induced by V on G. Since the maximum degree of vertices is bounded by 2zi, we 
have \V'\ < (2Z\)^t and hence G' is /-path decomposable by Theorem[5] 

The first step of the construction relies on the following theorem by Charikar et al. [3] 
that shows that if a graph G' is /-path decomposable, then there exists a distribution on 
partitions of V such that close vertices are likely to remain in the same partition while 
distant vertices are likely to be separated. 


Theorem 10 (Charikar et al. [4]). Suppose G' = {V',E') is an l-path decomposable 
graph. Let d{-, •) be the shortest path distance on G , and L = [//9J; p, € [1/L, !]• Then there 
exists a probabilistic distribution of multicuts of G' (or in other words random partition of 
G' into pieces) such that the following properties hold. For every two vertices u and v, 

1. If d{u,v) < L, then the probability that u and v are separated by the multicut (i.e. lie in 

different parts) equals 1 — (1 — moreover, if u and v lie in the same part, then 

the unique shortest path between u and v also lies in that part. 

2. If d{u, v) > L, then the probability that u and v are separated by the multicut is at least 

3. Every piece of the multicut partition is a tree. 


Based on this random partitioning, we define a distribution on the vertices in S (actually 
in V). As each piece of the above partition is a tree, given some vertex u with an arbitrary 
label i, we can extend it to a labeling i for every other vertex in that piece such that every 
directed edge {x,y') in the piece satisfies Ky) — £(x) = 1 (mod k). 

For vertices u and v with d{u,v) < L, we say that label i for u and i' for v match if 
the labeling i{u) = i,i{v) = i' can be extended so that for every directed edge (x, y) on the 
unique shortest path between u and v, i{y) — £{x) = 1 (mod k). Note that there are exactly 
k such matching pairs for every u and v. We can now use Theorem 1101 to obtain a random 
labeling as follows. 

Corollary 2. Suppose G' = {V',E') is an l-path decomposable graph. Let L = [//9j;/i G 
[1/L, 1]. Then there exists a random labeling r :V' ^ [/c] such that 


1. If d = d(u,v) < L, then 
Pr[r(w) = i,r{v) = i'] = 

2. If d > L, then 


—h -^ p— 




if i and i' match 
otherwise 


^ < Pr[r(u) = i, r(y) = i'] < ^ -1- for any i, i' G [k] 


Proof. We first sample from the distribution of multicuts given by Theorem 1101 For every 
piece obtained, we pick an arbitrary vertex u and assign r(u) to be a uniformly random 
label from [/c]. Now, since each piece is a tree, we can propagate this label along the tree 
so that for every directed edge {v,w) we have r{w) — r{v) = 1 (mod k). Note that the final 
distribution obtained does not depend on the choice of the initial vertex u. 

Consider any two vertices u and v. If d{u, v) < L, then if u and v are in the same piece, 
then the path connecting u and v in the piece is the shortest path. If i and i' are matching 


16 








labels, then 

Pr[r(w) = i, r{v) = i'] = Pr[u, v in the same piece] • 

+ Pr[M, V are separated] • 

On the other hand, if i and i' are not matching, 

Pr[r(M) = i, r[v) = i'] = Pr[M, v in the same piece] • 0 
+ Pr[u,?; are separated] • 

Similarly, if d(M, v) > L, then Pr[r(M) = i, r{v) = i'] is lower bounded by Pr[u, v are separated]/ 
and upper bounded by Pr[M,n in the same piece]/A: + Pr[u^v are separated]. Substitut¬ 
ing the separation probabilities in Theorem 1101 proves the desired result. □ 

The above random labeling defines a distribution vs over labels of pairs of vertices as 
follows. 

Definition 5. Let S = {vi,V 2 , ■ ■ ■ ,vt} be a fixed set of vertices. For any two vertices u,v G S 
and i,i' G [fc], let vs(u(i), v(i')) = Pr[a;(w) = i,x(v) = i'] in the local distribution on S 
defined by r in Corollary\^ 

We now define another distribution p over labels for pairs of vertices that is independent 
of the choice of the set S as follows. 

Definition 6. For any vertices u v and i,i' G [k], let p{u(i),v{i')) = Pr[a;(M) = i,x(v) = 
i'] if d{u,v) < L, and otherwise. Also define p{u{i),u{i)) = ^ and p{u{i),u{i')) = 0 for 
i i'. Since the shortest path between u and v is unique when d{u,v) < L, p is uniquely 
defined by D and G and is independent of the choice of set S. 

Lee [13] shows that it is possible to use the p and vs distributions defined above to produce 
consistent distributions over events of the form {i{vi) = xi,... ,£(vt) = Xt}xi,...,xt&[k]- 
Further, these distributions need to be consistent, i.e., the marginal distribution on S' fl S" 
does not depend on the choice of its superset (S or S') that is used to obtained the larger 
local distribution. The key idea here as shown by Charikar et al. |3] is to embed p into 
Euclidean space with a small error to obtain tk vectors such that u{i) ■ v(i') 

p{u{i), v{i')). This uses the large noise property in Theorem[9l The following lemma appears 
as Lemma 5.7 in m- 

Lemma 7 (Lee |13| L There exist tk vectors {r’(i)}„g 5 such that ||n(f)||| = p, + 
and u{i) ■ v{i') = f + p{u{i),v{i')). 

Given such tk vectors, one can use a geometric rounding scheme to define the consistent local 
distributions. Note that the local distribution is completely defined by the pairwise inner 
products of the vectors which, for any two vectors, is independent of the subset S. Lee [T3] 
shows that the following simple rounding scheme suffices to obtain a good distribution: 
choose a random Gaussian vector g, and for each vertex v, let i(v) = argmaxj(u(i) • g). 

Lemma 8 (Lee |13| [^. There exists a p > 0 depending on k and e such that, in the above 
rounding scheme, for any edge {u, v) and any label i G [fc] the probability that i{u) = i and 
£{v) = i + 1 {mod k) is at least . 

®The lemma follows from the proof of Lemma 5.8 of Lee m by substituting Ia{u, v) = 1. 
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Consider the solution to rounds of the Sherali-Adams hierarchy obtained by the 

above rounding process. For any edge (u, v) € A, its contribution to the objective is 


Pr[£(u) = i,£(v) 


i'] > ^ Pr[£(u) = 
ie[fc-i] 


^ E 

ie[fc-i] 


1 - 12e 
k 


i + 1] 


The last inequality follows due to Lemma |8l Thus we have a fractional solution with value 
at least (1 — 12e)(l — i). This, along with the low optimum of the instance from Theorem 
ini completes the proof of Theorem [T] 


6 The DED(fc) Problem 

Recall that the DED(fc) problem is to remove the minimum weight subset of edges from a 
given DAG so that the remaining digraph does not contain any path of length k. 


6.1 Combinatorial fc-Approximation 

In the unweighted case (i.e. all edges have unit weight), the following simple scheme is a k- 
approximation algorithm. As long as the DAG contains a directed path P of length k, delete 
all edges of that path. It is easy to see that the above scheme guarantees a fc-approximation 
as the optimal solution must delete at least one edge from the path P while the algorithm 
deletes exactly k edges. 

The following slightly modified scheme that uses the local ratio technique yields a k- 
approximation for weighted DAGs. 

Algorithm Local Ratio; 

1. 5” ■«— {e € if I w{e) = 0} 

2. While {V, E\ S) contains a path P of length k 

(a) Wmin ^ minegp(r(;(e)) 

(b) w{e) = w{e) - Wmin^e G P 

(c) S =■<— {e G if I w{e) = 0} 

Theorem 11. LocalRatio is a polynomial time k-approximation to the DED(fc) problem on 
weighted DAGs. 

Proof. We note that the LocalRatio terminates in at most |if| iterations as the weight of 
at least one edge reduces to 0 in each iteration. Also, since one can check if there exists 
a path of length k in DAG via a dynamic programming, it follows that LocalRatio runs in 
polynomial time. 

Let O C if be an optimal solution and 5 C if be the solution returned by LocalRatio. 
Note that an edge is in S if its weight is reduced to 0 in some iteration of the algorithm. 
Thus, the weight of S is upper bounded by the total reduction in the weight of the edges. 
At each iteration, for a path P of length fc, the reduction is at most fc times the minimum 
weight edge (according to the current weights) on in P. Since there is at least one edge e in 
P which is in O, we charge this reduction to the weight of e. Then the weight of e decreases 
by at least 1/fc factor of what is charged to it, and it cannot decrease beyond 0. Thus, the 
weight of S is at most the fc times the weight of O. □ 
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6.2 fe-Approximation via LP Rounding 

The natural LP relaxation for DED(fc) on an n-vertex DAG D = (V, E) is given in Figure[3] 
This relaxation has constraints. However, when the input graph is a DAG, it admits 


min w{e)xe 

eSE 

subject to. 


(28) 

V paths P of length k, 

Al 

(29) 

VeG P, 

Xe > 0. 

(30) 

Fig. 3. LP Relaxation for instance X of DED(fc). 



the following polynomial time separation oracle for any k € {2,... ,n — 1}. 


Separation Oracle and Rounding. For each vertex v G V and integer t G [n], define 
a” = ininp(^ggp Xe) where P is a path of length t that ends at vertex v. Once we compute 
all these values, then a constraint is violated if and only if there is a vertex v such that 

On a DAG the {a^ \ v G V,t G [n]} can be computed by dynamic programming. First 
assume that the vertices are arranged in a topological order. For any vertex v with no 
predecessors, set = 0, Vt. Otherwise, we have the following recurrence, 

at = min 

uGpredecessors(R) 

It is easy to see that the above recurrence leads to a dynamic program on a DAG. Once we 
obtain an optimal solution to the LP relaxation, a simple threshold based rounding using a 
threshold of 1/fc yields a fc-approximation. 

Theorem 12. The standard LP relaxation for DED(fc) on n-vertex DAGs can be solved in 
polynomial time for k = {2,..., n — 1} and yields a k-approximation. 

6.3 Hardness of Approximation 

For fixed integer k > 2 and arbitrarily small constant e > 0, Svensson [18] showed factor 
{k — e) UGG-hardness of the vertex deletion version of DED(fc), which requires deleting 
the minimum number of vertices from a given DAG to remove all paths with k vertices. In 
particular, m proves the following structural hardness result. 

Theorem 13 (Svensson |18| L For any fixed integer t > 2 and arbitrary constant e > 0, 
assuming the UGC the following is NP-hard: Given a DAG D{V,E), distinguish between 
the following cases: 


19 



— (Completeness): There exist t disjoint subsets Vi,...,Vt C V satisfying \Vi\ > 

and such that a subgraph induced by any t — 1 of these subsets has no directed path of t 
vertices. 

— (Soundness): Every induced subgraph on e\V\ vertices has a path with vertices. 

The following theorem provides a simple gadget reduction from the above theorem to a 
hardness for DED(fc) on DAGs. 

Theorem 14. Assuming the UGC, for any constant fc > 4 and e > 0, the DED(fc) problem 
on weighted DAGs is NP-hard to approximate with a factor better than ([fc/2j — e). 

Proof. Fix t = [k/2\. Let D = (V, E) be a hard instance from Theorem[T3]for the parameter 
t and small enough e > 0. The following simple reduction yields a weighted DAG El = 
{Vh,Eh) as an instance of DED(fc). Assign w{e) = 2|E| to every edge e £ E. Split every 
vertex v £ V into Vm and Vout and add a directed edge {vin,Vout) of weight 1. Also every 
edge entering v now enters Vin while edges leaving v now leave Vout- It is easy to see that 
removing all edges of weight 1 from El eliminates all paths with 2 edges, implying that the 
optimum solution has weight at most \V\. Thus, we may assume that the optimum solution 
does not delete any edge of weight 2|E|. 

We now show that Theorem [T51 implies that it is C/G-hard to distinguish whether: (Com¬ 
pleteness) El has a solution of cost < (i -b e)|E|, or (Soundness) H has no solution of cost 
(1 — e)|E|. This immediately implies the desired {t — e') = {[k/2\ — e') UGC-hardness for 
DED(A:). 

— (Completeness) There exists a subset S' C E of size at most {\ -\- £:)|E|, such that 
removing S eliminates all paths in D of t vertices. Let S' denote the set of edges in 
H corresponding to the vertices in S. It is easy to observe that EI{Vh,Eh \ S') has 
no paths of length (number of edges) 2t. Thus, S' is a feasible solution to the DED(fc) 
problem of cost (^ -b e)|E|. 

“ (Soundness) Assume for the sake of contradiction, that we have an optimal solution 
S' C Eh of cost at most (1 — e)|E|. Since S' is an optimal solution it only has edges of 
weight 1, each of which correspond to a vertex in V. Let S denote this set of vertices in 
V. By construction, since EI{Vh,Eh \ S') has no paths with k edges, D[V \ S] has no 
induced paths with [fc/2j -b 1 = t -b 1 vertices. Further, since |S'| = |S| < (1 — e)|E|, 
we have |E \ S| > e\V\. Thus, we have a set of size e\V\ that has no induced paths of 
length t -b 1. This is a contradiction since every induced subgraph of e|E| vertices has a 
path of length t -b 1. 

□ 
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